; Title of the Invention 

CARRY SAVE ADDER S 

i 
i 

Field of the ' Invention 

The present' invention relates to carry save adders and in 
particular tut not exclusively to carry save adders wh].ch are 
able to reduce the number of partial products to two. 

Background to the Invention 

In digital arithmetic, partial products are obtained when two 
numbers are multiplied together. The number of partial products 
will depend; on the method used for obtaining the partial 
products. IrL a conventional operation, the number of partial 
products may be equal to the number of bits in the multiplier. 
However, techniques such as Booth encoding make it possible to 
reduce the number of partial products obtained. For example Booth 
coding allowfe the number of partial products to be reduced by a 
ractor of 2. The Eocth enccding method is sometimes referred to 
as the Booth,- MacScr ley algorithm. 

In the Booth coding method, a triplet of bits of the multiplier 
is input to a Booth coder which provides three outputs, the 
values of whilch depends or. the input values. The outputs of the 
Booth coder !are then used to modify bits of the number to be 
multiplied by the multiplier. 

Carry save adders which reduce three partial products to two 
partial products are known. This type of carry save adder is 
sometimes referred to as a 3 to 2 carry save adder. Such a carry 
save adder is in fact a full adder. A full adder is a binary 
logic circuit which produces a two-bit sum where one bit 
represents the sum and the other bit represents the carry when 
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three one b^t binary numbers are added togecher. The three one 
ait numbers may be corresponding bics from three partial 
products. 

Carry save adders which reduce four partial products to two 
partial products are also known. This type of carry save adder is 
known as a 4 to 2 carry save adder. Cne example of such a carry 
save adder i;s shown in the IEEE Journal of Solid State circuits, 
Vol 30, No '3, March 1995: "A 4.4ns CMOS 54x54 -b using pass- 
transistor mjltiplexer" , page 251 tc 257, N ohkubo et al . This 
structure generally yields faster partial producr compression 
than the usej of 3 to 2 carry save adders . The 4 to 2 carry save 
adder actually compresses five partial products into three and is 
therefore so.net imes referred to as a 5 to 2 carry save adder. 
This carry s<::ve adder is connected in such a way that four of the 
inputs are corresponding bits from four partial products whilst 
the fifth input is fed from a neighbouring adder and is known as 
the "carry in". The output of this carry save adder consists of 
one sum bit «.nd two carry bits. One of these carry bits is input 
to a neighbouring carry save adder and forms one of the five 
inputs of that carry save adder, 
i 

Summary of the Invention 

It is an aim of embodiments of the present invention to provide a 
carry save adder unit which is capable of compressing more than 
five inputs t;o a smaller number of outputs. 

According tc one aspect of the present invention, zhere is 
provided a c<=.rry save adder for reducing the number of inputs tc 
a lower number of outputs, said carry save adder comprising four 
carry save adders, said four carry save adders being arranged in 
two layers with the first and second carry save adders being 
arranged in el first of said layers and the third and fourth carry 
save adders being arranged in a second of said layers, said third 



and fourth ::arry save adders being arranged to provide said 
outputs, said third and fourth carry save adders each receiving 
at least one output from each of said first and second carry save 
adders and t.rie first and second carry save adders being arranged 
to receive a'.: least some of said inputs. 

This arrangement permits a relatively shore routing between the 
carry save adders to be achieved. This may also improve timing. 

At least one of the inputs may be input to at least one of the 

third and fourth carry save adders, 

i 
i 

i 

The carry seive adder circuit may be a 3 to 4 carry save adder 
circuit. The 1 first carry save adder may be a 5 to 3 carry save 
adder and the second, third and fourth carry save adders may be 3 
to 2 carry s«ve adders. The first to fifth inputs may be input to 
the first carry save adder and the sixth to eighth inputs may be 
provided to the second carry save adder. The ninth input may be 
input, directly to one of rhe third and fourth carry save adders. 

The carry save adder circuit may alternatively be a 7 to 4 carry 
save adder circuit. The firsc to third carry save adders may be 3 
to 2 carry save adders and the fourth carry save adder may be a 
half adder. ::he first to third inputs may be input to the first 
carry save adder and the fourth to sixch inputs may be provided 
to the second carry save adder. The seventh input may be input 
directly to :me of the third and fourth carry save adders. 

According to a second aspect of the present invention there is 
provided a curry save adder circuit for reducing nine inputs to 
four outputs, said carry save adder circuit comprising four carry 
save adders, the first carry save adder being a 5 zo 3 carry save 
unit and the second, third and fourth carry save adders being 3 
to 2 carry snve adders. 
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According tc a third aspect cf the present invention chare is 
provided a curry save adder circuit for reducing seven inputs to 
four outputs , said carry save adder circuit comprising four carry 
save adders, the first, second and third carry save adders being 
3 to 2 carry save adders and the fourth carry save unit being a 
half adder. 

According to a fourth aspect of the present invention there may 
be provided an arithmetic unit for processing a plurality of 
partial products, said unit comprising a plurality of carry save 
adder circuits as claimed in any preceding claim, wherein the 
inputs to each of said carry save adder circuits are provided by 
said plurality of partial products. 

i 

Brief Description of the Drawings 

For a better : understanding of the present invention and as to how 
the same may be carried into effect, reference will now be mads 
by way of example to the accompanying drawings in which: 

Figure 1 shows a 3 to 2 carry save adder; 
Figure 2 shoi/s a 5 to 3 carry save adder; 

Figure 3 shows three 5 to 3 carry save adders connected together 
in series and used as 4 to 2 carry save adders; 
Figure 4 shows a 9 to 4 carry save adder embodying the present 
invention ; 

Figure 5 shows a arithmetic unit embodying the present invention; 
Figure 6 shows a 1 to 4 carry save adder embodying the present 
invention ; 

Figure 7 shows one possible structure of a 3 to 2 carry save 
adders- 
Figure S shows one possible structure of a 5 to 3 carry save 
adder; and 

Figure 9 shojs the construction of the multiplexers of Figure 3 
in more detail , 



Description of the Preferred Embodiments 



Reference will first be made to Figure 1 which shows a 
conventional 3 to 2 carry save adder 2. It should be appreciated 
that carry save adders are scrrerimes referred to as compressors . 
The carry save adder 2- cf Figure 1 is a conventional full adder 
and has thr<=e inputs A, B and C. Inputs A, B and C are to be 
added together. The carry save adder 2 cf Figure 1 has two 
outputs S and Cout. The first output S is the sum curpur and the 
second output Cout is the carry output. The first output S 
represents tlie least significant bit of the sum of A and 3 and C 
whilst the second output Cout represents the most significant bit 
of the sum c.E A and B and C. 

The truth table for the carry save adder of Figure 1 is as 
follows : - 
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Figure 2 shows a 5 to 3 carry save adder. As mentioned 
hereinbefore, this carry save adder is sometimes referred to as a 



4 to 2 carry- save adder. The carry save adder 4 of Figure 2 has 
five inputs, inputs A, 3, C, D and CI. A, B, C and D are to be 
added together along with CI which represents a carry input. 
Where a number of adders are connected, the inpui CI will be 
provided by t:he output of a preceding carry save adder. 

The carry sat-e adder 4> of Figure 2 has three outputs, the first 
output S, the second output C01 and -he third output CC2. The 
first output S, which is the sum output, represents the least 
significant bit of the sum of A + B + C + D r CI. The other two 
output signals CC1 and C02 are the carry signals. These carry out 
signals are same as the carry out signals for a full adder except 
that there are two carry out signals which have the same 
significance (2 1+1 if the inputs have the significance 2 1 ) . 

Reference is now rttade to Figure 4 which shows a 3 io 4 carry save 
adder 20 embodying the present invention. The carry save adder 20 
embodying th=> present invention comprises first ~o fourth carry 
save adders 2 2 to 2 8 respectively. The first carry save adder 22 
is a 5 to 3 carry save adder of the type shown in Figure 2. The 
second, third and fourth carry save adders 24, 25 and 28 are 3 to 
2 carry save adders or full adders as shown in Figure 1. 

The first cai:ry save adder 22 receives the firsc five inputs Il- 
ls. These first five inputs represent bits from the first to 
fifth partial products. The second carry save adder 24 receives 
the sixth, seventh and eighth inputs IS, 17 and IS representing 
bits from the sixth to eighth partial products. 

The sum output S of the first carry save adder 2 2 provides the 
third input c of rhe fourth carry save adder 23. The first carry- 
output C01 of the first carry save adder 22 provides the third 
input c of the third carry save adder 26. The second carry output 
C02 of the first carry save adder 22 provides the second input b 
of the third carry save adder 26. 
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The sun output S of the second carry save adder 24 provides the 
second input h of the fourth carry save adder 28. The carry 
output CC1 c ; : the second carry save adder 24 provides the first 
input a of the third carry save adder. 

The third carry save adder 26 therefore receives all three of its 
inputs a zo c from the outputs of the first and second carry save 
adders 22 arid 24. The fourth carry save adder 28 thus has its 
second and t.iird inputs b and c provided by the outputs of the 
first and second carry save adders 22 and 24. The first input a 
of the fourth carry save adder 28 is provided by the nintn input 
19 from zhe ninth partial product. 

The fourth ' carry save adder 2 8 provides a sum output S 
(significance 2 1 ) and a first carry C02A of significance 
The zr.Lrd carry save adder 76 generates a second carry C02B of 
significance- 2 i+1 and a third carry C04 of significance 2 1 " 2 . This 
assurr.es that' each of the inputs II to 19 have a 2 1 significance. 

It should bej appreciated that the interconnections described in 
relation to !?igure 4 is only one way of achieving the required 
function. For example the inputs of cell 2 6 can be swapped or the 
inputs of uhe cell 28 can be swapped. 

The carry Scjve adder 20 of Figure 4 has two inputs which are 
faster than the others. This has the advantage that when .some of 
the parrial products are generated later compared to' others, the 
partial products which are generated later can be input to the 
faster inputs. In this way the timing differences between the 
partial products can be reduced or avoided. The ninth input 19 is 
fastest as ;it cresses only one cell or carry save adder. The 
fifth input 15 which is the carry input CI to the first carry 
save adder 22 is also faster than the other inputs to that same 
carry save adder or to the second carry 3ave adder 24. This will 



be discussed in more detail in relation to Figure 8 later. 



In preferred, embodiments of the present invention, the carry save 
adder 20 can be laid our in such a way as to reduce the number of 
wires crossj.ng large distances. This is because there are only 
four cells and they can be arranged next: to each other in a 
compact arrangement. Wich this arrangement the routing problems 
can be reduced as compared to structures such as the Wallace 
tree . 

Reference is made to Figure 5 which shows a context in which the 
carry save £|ider 20 of Figure 4 can be used. 

In the embodiment shown in Figure 4, a further carry save adder 
may be addedl This further carry save adder may be a 4 to 2 carry 
save adder vrhich generates two partial products. This 4 to 2 
carry save adder will receive four bits of the same significance. 
Assuming that the carry save adder deals with bits of 
significance, the carry save adder will receive S from column 1/ 
the output C02A and C02B from the previous column 1-1 and the 
oucput C04 from the column 1-2. 

Figure 5 shows an arithmetic unit 3 0 which is arranged to 
multiply twc' numbers, X and Y together. Y is the multiplier in 
this case. !f has 17 bits and is subject to Booth coding. Y 
represents a 16 bit number with the 17 rh bit being a 1-bit 
extension: j=ign extension if the 16 bit data is signed, 0 
extension if it is unsigned. This enables Y to be considered as a 
17 bit signed data indicating if the dar.a is signed or unisigned. 
The Booth coding of the Y value results in outputs which are used 
to modify the X values. As mentioned hereinbefore, Booth coding 
allows the number of partial products to be reduced. For example, 
if Y is 17 bits long, nine partial products will result from the 
multiplication of those two numbers, if Booth coding is used. The 
nine partial products are referred to by references P0 to P3. 
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Each partial product ?0 co ?3 has n bits. One bit from each of 
the partial products is input tc the carry save adder 2C of 
Figure 4. If; the mth bit of the first partial product- PC is 
selected, the m-2 th bit will be selected from the second partial 
product PI , the m-4 ch bit selected from the third partial product 
P2 and so on In this scenario, the first bit in a given partial 
product is t ne least significant bit whilst the nth bit in the 
partial product is the most significant bit. In certain cases, 
there will be no bits available. For example, if m=4 , the 
corresponding bit from the fourth partial product P3 would be r=- 
2. This gives; a negative value. In these circumstances the value 
zero would be taken as the input to the carry save adder. 
Effectively i;he values of the partial products are offset so as 
to take into account the significance or weight of each of the 
partial product. 

The number of: carry save adders 20 which are provided is defined 

by the following equation: 

number of bits = n + 2 (p-L) where 

n is the number of bits in each partial products 

p is the number of partial products. 

As discussed: above , there are not always 9 bits. For example 
column 5 only has three bits. In those circumstances it is not 
necessary to use the 9 tc 4 adder. Instead, a 3 to 2 carry save 
adder can he used. Accordingly, the carry save adder for each 
column can he selected in accordance with the number cf bits 
which are tc be added. 

Each carry jsiave adder provides four outputs as discussed in 
relation to Figure 4. The sum output S of the third carry save 
adder 26 provides one bit of a first partial product 34, the 
second output: C02A of the third carry save adder 26 provides one 
bit of a second partial product 36, the first output C02B of the 



10 

fourth carry save adder 28 provides cne bit of the third partial 
product 3 6 t.-d the second output C04 of the fourth carry save 
adder 2 8 provides cne bit of a fourth partial product fr D . The 
first output of the carry save adder 20 provides the mth position 
of the first: partial produce 34, The second and third outputs 
C02A and CQ2;i provide the m+lth positions of the second and third 
partial products 36 and 35. The fourth output of the carry save 
adder C04 p::cvides the m+2th position of the fourth partial 
product 40. "he positions to which the outputs of the carry save 
adder 20 are provided in the respective partial products 34 to 40 
reflects their significance or weight. The outputs of adjacent 
carry save adders are at adjacent locations in the partial 
products 34 :o 40. 

These four partial products 34 to 40 which receive the outputs of 
the carry sc.ve adder 2 0 are input to respective 4 to 2 adders 
which reduce the number of partial products to two. These two 
partial products can then be summed to provide a single result. 

Reference is now made to Figure 6 which shows a 7 to 4 carry save 
adder 50. The 7 to 4 carry save adder 50 comprises four carry 
save adders 52 to 53. The first, second and third carry- save 
adders 52 to 5 6 are 3 to 2 adders, such as illustrated in Figure 

I. The fourth carry save adder 58 is a half adder which receives 
two inputs and provides two outputs. 

The first carry save adder 52 receives the first to third inputs 

II, 12 and 13. The second carry save adder 54 receives the fourth 
fifth and sixth inputs 14, 15 and 16. The sum output B of the 
first carry <-:ave adder 52 provides the first input a to the third 
carry save adder 56. The sum output of the second carry save 
adder 54 provides the second input b to the third carry save 
adder 56. Th5 third input to the third carry save adder 56 is 
provided by the seventh input 17. 



11 

The fourth carry save adder 53 receives the carry output C01 of 
the first ar.d the second carry save adders which provide the 
first and second inputs a and b respectively. 

The output of the third carry save adder provides the output S as 
its sum output and the output C02A as irs carry output. The sum 
output of the fourth -carry save adder 53 provides the second 
carry output C02B output whilst the carry output of the fourth 
carry save akder 53 provides the carry output C04 . These outputs 
have the sam* significance or weight as those outputs of the 9 to 
4 carry save adder shown in Figure 4. 

The seventh input 17 is faster than the other inputs in that it 
only has to pass through one of the carry save adders. The first 
and second carry save adders 52 and 54 receive the first six 
inputs 11 to '16. The first and second carry save adders 52 and 54 
effectively provide a first layer. The third carry save adder 56 
and the half adder 5S provide a second layer and receive the 
respective outputs from the first layer as well as the seventh 
input 17. In^ practice, the adders making up the carry save adder 
of Figure 6 <:an be arranged in any suitable way so as to minimise 
the length o:: the connections as with the embodiment described in 
relation to Figure 4. 

The carry save adder cf Figure 6 can also be used in a similar 
context to that shown in Figure 5 if seven partial products are 
generated, particularly if seven bits are co be processed in a 
column. The carry save adder can also be used in the embodiment 
shown in Figure 5 where- there are more than seven partial 
products but only seven bits are provided for a given column- 

It should be noted that, as with the carry save adder of Figure 
4, some of the interconnections between zhe carry save adders 
making up the 7 to 4 carry save adder can be altered without 
altering the function of the adder as a whole. For example the 
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inputs to th«: third carry save adder 56 can be swapped with ether 
inputs to that carry save adder, Likewise the inputs to the half 
adder 5 3 can be swapped around. 

Again a 4 to 2 carry save adder can be provided to reduce the 
number of partial products zo two. 

The arrangement of both the B to 4 and the 7 to 4 carry save 
adders allow short routings to be achieved between the carry save 
adders. This improves both timing and routing of these carry save 
adders. This is particularly advantageous when embodiments of the 
present invention are incorporated in an integrated circuit, 
i 

Reference is made to Figure 3 which shews three of the carry save 
adders 6a, 6b and 6c of Figure 2 connected together. Each of the 
5 to 3 carry; save adders 6 receives inputs A to D and the input 
CI from the preceding carry save adder and provides three outputs 
as in Figure 2. The first carry save adder block 6a receives a 
carry input CI from a preceding carry save adder. If there is no 
previous carry save adder, then input CI will be zero. The first 
carry save adder 6a outputs the second carry output C02 to the 
second carry! save adder 6b. This second carry output C02 from the 
first carry isiave adder is the carry input CI to the second carry 
save adder 6b. Likewise, the second carry save adder 6b provides 
its second carry output C02 as the carry input CI to the third 
carry save adder 6c. It should be noted that the input CI is not 
used to generate the C02 output of the same carry save adder. 

This means that the carry propagation is only done from one carry 
save adder t:> the next. The respective inputs A to D to each of 
the carry save adders 6 may represent adjacent bits frcm four 
partial products. For example, the first carry save adder 6a 
receives the nth bit of first, second, third and fourth partial 
products with the second and third carry save adders 6b and c 
receiving the n+1 and n+2 bits respectively of those same partial 
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products . 

Figure 3 illustrates why the 5 to 3 carry save adder is sometimes 
referred to as the 4 to 2 carry save adder. In particular four 
inputs A, B, C and D are reduced to ;wo outputs S and C01. 

Reference will now be made to Figure 1 which shows a structure of 
a 3 to 2 carry save adder 81. The carry save adder 81 has three 
inputs A, B -and C. The carry save adder 51 has first and second 
-exclusive OR gates 8 0 and 82 and firsc to fourth NAM) gates 84 to 
90. The firs); exclusive OR gate 80 receives the first input A and 
the output cj£ the second exclusive OR gate 82. The output of the 
first exclusive OR gate 80 provides the sum output S. 

The second Exclusive OR gate 82 receives the second and third 
inputs B and C as its two inputs. 

The first N.Knb gate 84 receives an input from the second and 
third inputs, B and C. The second gate BG receives its input 

from the first and third inputs A and C whilst the third NAKD 
gate 8 8 receives its inputs from the first and second inputs A 
and B. The qutput of each of the first to third NAMD gates 84 to 
8 6 provide the inputs to the fourth NAND gate 90, the output of 
which provides the carry output C. 

Reference is made to Figure 8 which shows one possibility for trie 
3 to 3 carry save adder 91. 

The 5 to 3 curry save adder 91 comprises six multiplexers 92, 93, 
94, 96 , 93 «:*xd 99 each of which has the construction shown in 
Figure 9. The first multiplexer 92 receives the fourth input 14 
and its inverse. An invercer 100 is provided for inverting the 
fourth input 14. The first multiplexer 92 also receives the third 
input 13 as a select signal. 
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The second multiplexer 93 receives the second signal 12 and its 
inverse. A second inverter 102 is provided for inverting the 
second input 12. The select signal is provided by the first input 
11. The third multiplexer 94 receives the first and third inputs 
11 and 13 arid the output of the second multiplexer 93 provides 
its select signal. The output of the third multiplexer 94 is 
input to a first buffer 104. The output cf the first buffer 104 
provides the second carry output C02 . 

The fourth multiplexer 96 receives the output of the first 
multiplexer 92 and its inverse. A third inverter 10 6 is provided 
for inverting the output of the first multiplexer S2. The output 
of the second multiplexer 93 provides the select signal for the 
fourth multiplexer 96. 

The fifth multiplexer 98 receives the carry input Ci and the 
fourth inpu: 14 as inputs and the output of the fourth 
multiplexer 96 as the select signal. The output of the fifth 
multiplexer 98 is input to a second buffer 108, the output of 
which provides the first carry output C01. 

The final multiplexer 99 receives the carry input Ci and its 
inverse. A fourth inverter 110 is provided to provide the inverse 
of the carry input cl. The select signal is provided by the 
output of the fourth multiplexer 96. The output of the sixth 
multiplexer is input to a third buffer 112, the output of which 
provides the sum output S. 

The arrangement of Figure 8 shows that the input Ci has less gate 
levels to cross that the inputs II, 12, 13 and 14 and is 
therefore faster than these latter inputs. 

Reference is made to Figure 9 which shows the construction of the 
multiplexers of Figure S in more detail. The inputs to the 
multiplexer 149 are X and y whilst the select signal is Z. The 
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multiplexer 145 comprises four pairs of transistors 150 to 156. 
Each pair o:i transistors comprises a p-type transistor 12 3 to 
126; and a r.-type transistor 12S to 134 respectively. 

In the first pair of transistors 150, the p-type transis-cr 120 
receives ths: select signal Z whilst the n-type transistxr 12S 
receives the* inverse of the select signal Z. This arrangement 
also applies to the third pair of transistors 154 comprising of 
the third p-;ype transistor 124 and the third n-type transistor 
F . 132. 

tp The second ate fourth pairs of transistors 152 and 156 are such 

^ that the n-fjpe transistors 13 0 and 134 receive the select signal 

£ Z whilst the p transistors 124 and 126 respectively receive the 

^ inverse of tr.e select signal Z. 

'•is? ; 

P circuitry for the carry save adders is by way of example only 

and any other suitable circuitry can be used. 

jh ; Embodiments of the present invention may be incorporated in a 

digital signal processor. Embodiments of the present invention 
may be incorporated in an integrated circuit. 



